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METHOD AND SYSTEM FOR GRID -ENABLED VIRTUAL MACHINES 
WITH DISTRIBUTED MANAGEMENT OF APPLICATIONS 



CROSS-REFERENCE TO RELATED APPLICATIONS 

5 The present invention is related to the following 

patent application, hereby incorporated by reference: 
"Method and System for a Grid-enabled Virtual Machine With 
Movable Objects", U.S. patent application serial no. 
(attorney docket no. AUS920030459US1) , filed (TBD) . 

10 

BACKGROUND OF THE INVENTION 



Field of the Invention 

The present invention relates to an improved data 
15 processing system and, in particular, to a method and 

apparatus for multicomputer data transferring and program 
execution. 



Description of Related Art 

20 The widespread adoption of Internet-related and 

Web-related technology has enabled the growth of a global 
network of interconnected computers that are physically 
supported by many thousands of organizations and 
businesses and many millions of individuals. Recently, 

25 enterprises have exerted effort to organize the 

computational capacity of many computers into a grid, 
which is a logical organization of many computers for 
providing a collective sharing of computational capacity 
and datastore capacity while maintaining local autonomous 

30 control over individual computers. 
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Many enterprises are cooperating within the Global 
Grid Forum™, which is supported by GGF, Inc., to develop 
community-driven standards for grid computing. The 
Globus Project™ is one effort that has been supported by 
5 government agencies, corporations, and universities to 

develop grid-related technologies, which has resulted in 
the development of the Open Grid Services Architecture 
(OGSA) , which is an initiative for incorporating grid 
concepts within a service-oriented framework based on Web 

10 services. The Globus Toolkit* is an implementation of the 
Open Grid Services Architecture that provides software 
programming tools for developing grid-enabled 
applications, and the Java* Community Grid (CoG) Kit is 
the portion of the Globus Toolkit* that provides access to 

15 grid services through a Java* framework. Although the 
Java* CoG Kit supports a Java* application in accessing 
grid services, the Java* CoG Kit does not incorporate the 
computational characteristics of a grid into a Java* 
virtual machine (JVM) . 

20 Therefore, it would be advantageous to have a 

method, a system, an apparatus, and/or a computer program 
product that incorporates computational characteristics 
of a grid within a virtual machine. It would be 
particularly advantageous to perform object management 

25 within a virtual machine such that the virtual machine 
may be implemented within a dynamically changing, grid 
computing environment. 
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SUMMARY OF THE INVENTION 



A method, an apparatus, a computer program product, 
and a data processing system provide for the operation of 
5 a virtual machine with embedded functionality for 
interoperating with other virtual machines in a 
computational grid. A plurality of virtual machines are 
run on one or more devices within a data processing 
system; each virtual machine in the plurality of virtual 

10 machines incorporates functionality for interoperating 

and associating with other virtual machines in a virtual 
machine cluster in a grid- like manner. Each virtual 
machine in the virtual machine cluster acts as a node 
within the virtual machine cluster. The virtual machines 

15 share information such that a virtual machine may be 
added to the virtual machine cluster or such that a 
virtual machine may be removed from the virtual machine 
cluster as the virtual machine cluster continues to run, 
thereby allowing the virtual machine cluster to act as 

20 one logical virtual machine. Load-balancing operations 
may be performed within the virtual machine cluster, and 
threads and objects may be moved within the virtual 
machine cluster. A single application image conforming 
to a typical application model can be executed within the 

25 grid-enabled virtual machine cluster in a way that 

enables the execution of the application to span multiple 
instances of a grid-enabled virtual machine, thereby 
allowing the execution of the application to span 
multiple devices. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The novel features believed characteristic of the 
5 invention are set forth in the appended claims. The 
invention itself, further objectives, and advantages 
thereof, will be best understood by reference to the 
following detailed description when read in conjunction 
with the accompanying drawings, wherein: 
10 FIG, lA depicts a typical network of data processing 

systems, each of which may implement the present 
invention; 

FIG. IB depicts a typical computer architecture that 
may be used within a data processing system in which the 
15 present invention may be implemented; 

FIG. 2 depicts a block diagram that shows a virtual 
machine cluster in a distributed data processing system 
in accordance an implementation of the present invention; 

FIG. 3A depicts a block diagram that shows 
20 components and/or information that may be used to enhance 
a virtual machine as a clusterable virtual machine in 
accordance with an implementation of the "present 
invention; 

FIGS. 3B-3D depict a set of block diagrams that 
25 shows the transference of object groups between clustered 
virtual machines ; 

FIG. 4 depicts a flowchart that shows a process for 
initializing a primary gJVM node; 

FIG. 5A depicts a flowchart that shows a process at 
30 a primary gJVM for allocating a unique gJVM identifier to 
a non-primary gJVM that is being initialized; 
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FIG. 5B depicts a flowchart that shows an 
alternative process at a primary gJVM for allocating a 
unique gJVM identifier to a non-primary gJVM that is 
being initialized; 
5 FIG. 6 depicts a flowchart that shows a process at a 

primary gJVM for allocating a unique object group 
identifier to a non-primary gJVM; 

FIG. 7 depicts a flowchart that shows a process at a 
primary gJVM for providing information about the gJVM 
10 cluster to a non-primary gJVM; 

FIG. 8 depicts a flowchart that shows a process at a 
primary gJVM for changing a run state of a non-primary 
g JVM ; 

FIG. 9 depicts a flowchart that shows a process at a 
15 primary gJVM for changing a migration state of an object 
group / 

FIG. 10 depicts a flowchart that shows a process for 
initializing a non-primary gJVM node; 

FIG. 11 depicts a flowchart that shows a process on 
20 a clustered virtual machine during which a thread is 
constructed; 

FIG. 12 depicts a flowchart that shows a process on 
a clustered virtual machine during which a thread is 
destroyed; 

25 FIG. 13 depicts a flowchart that shows a process on 

a clustered virtual machine during which an object is 
constructed; 

FIG. 14 depicts a flowchart that shows a process on 
a clustered virtual machine for handling accesses to 
30 objects; 
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FIG. 15 depicts a flowchart that shows a process on 
a local clustered virtual machine for migrating an object 
group to a remote clustered virtual machine; 

FIG, 16 depicts a flowchart that shows a process on 
5 a local clustered virtual machine for accepting the 
migration of an object group from a remote clustered 
virtual machine; 

FIG. 17 depicts a flowchart that shows a process on 
a local clustered virtual machine for migrating a thread 
10 to a remote clustered virtual machine; 

FIG. 18 depicts a flowchart that shows a process on 
a local clustered virtual machine for accepting the 
migration of a thread from a remote clustered virtual 
machine ; 

15 FIG. 19 depicts a flowchart that shows a process on 

a local clustered virtual machine for performing a 
shutdown operation on a clustered virtual machine; 

FIG. 20 depicts a flowchart that shows a process on 
a local clustered virtual machine for gathering load or 

20 performance information from a remote clustered virtual 
machine; 

FIG. 21 depicts a flowchart that shows a process on 
a local clustered virtual machine for responding to a 
request from a clustered virtual machine for the current 
25 load or performance information at the local clustered 
virtual machine; 
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FIG. 22 depicts a flowchart that shows a process on 
a local clustered virtual machine for determining to 
migrate threads and/or object groups from the local 
clustered virtual machine to a remote clustered virtual 
5 machine in accordance with load or performance 
information; and 

FIG. 23 depicts a block diagram that shows the 
execution of a single application image that spans 
multiple grid-enabled virtual machines in a single 
10 logical virtual machine over multiple devices. 
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DETAILED DESCRIPTION OF THE INVENTION 

5 In general, the devices that may comprise or relate 

to the present invention include a wide variety of data 
processing technology. Therefore, as background, a 
typical organization of hardware and software components 
within a distributed data processing system is described 

10 prior to describing the present invention in more detail. 

With reference now to the figures, FIG. lA depicts a 
typical network of data processing systems, each of which 
may implement a portion of the present invention. 
Distributed data processing system 100 contains network 

15 101, which is a medium that may be used to provide 

communications links between various devices and computers 
connected together within distributed data processing 
system 100. Network 101 may include permanent 
connections, such as wire or fiber optic cables, or 

20 temporary connections made through telephone or wireless 
communications. In the depicted example, server 102 and 
server 103 are connected to network 101 along with storage 
unit 104. In addition, clients 105-107 also are connected 
to network 101. Clients 105-107 and servers 102-103 may 

25 be represented by a variety of computing devices, such as 
mainframes, personal computers, personal digital 
assistants (PDAs), etc. Distributed data processing 
system 100 may include additional servers, clients, 
routers, other devices, and peer-to-peer architectures 

30 that are not shown. 
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In the depicted example, distributed data processing 
system 100 may include the Internet with network 101 
representing a worldwide collection of networks and 
gateways that use various protocols to communicate with 
5 one another, such as Lightweight Directory Access Protocol 
(LDAP) , Transport Control Protocol/Internet Protocol 
(TCP/IP) , Hypertext Transport Protocol (HTTP) , Wireless 
Application Protocol (WAP) , etc. Of course, distributed 
data processing system 100 may also include a number of 

10 different types of networks, such as, for example, an 
intranet, a local area network (LAN), or a wide area 
network (WAN) . For example, server 102 directly supports 
client 109 and network 110, which incorporates wireless 
communication links. Network-enabled phone 111 connects 

15 to network 110 through wireless link 112, and PDA 113 

connects to network 110 through wireless link 114. Phone 
111 and PDA 113 can also directly transfer data between 
themselves across wireless link 115 using an appropriate 
technology, such as Bluetooth™ wireless technology, to 

20 create so-called personal area networks (PAN) or personal 
ad-hoc networks. In a similar manner, PDA 113 can 
transfer data to PDA 107 via wireless communication link 
116. The present invention could be implemented on a 
variety of hardware platforms; FIG. lA is intended as an 

25 example of a heterogeneous computing environment and not 
as an architectural limitation for the present invention. 

With reference now to FIG. IB, a diagram depicts a 
typical computer architecture of a data processing system, 
such as those shown in FIG. lA, in which the present 

30 invention may be implemented. Data processing system 120 
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contains one or more central processing units (CPUs) 122 
connected to internal system bus 123, which interconnects 
random access memory (RAM) 124, read-only memory 12 6, and 
input/output adapter 128, which supports various I/O 
5 devices, such as printer 130, disk units 132, or other 
devices not shown, such as an audio output system, etc. 
System bus 123 also connects communication adapter 134 
that provides access to communication link 136. User 
interface adapter 148 connects various user devices, such 
10 as keyboard 140 and mouse 142, or other devices not 

shown, such as a touch screen, stylus, microphone, etc. 
Display adapter 144 connects system bus 123 to display 
device 146. 

Those of ordinary skill in the art will appreciate 

15 that the hardware in FIG. IB may vary depending on the 

system implementation. For example, the system may have 
one or more processors, such as an Intel® Pentium®-based 
processor and a digital signal processor (DSP) , and one 
or more types of volatile and non-volatile memory. Other 

20 peripheral devices may be used in addition to or in place 
of the hardware depicted in FIG. IB. The depicted 
examples are not meant to imply architectural limitations 
with respect to the present invention. 

In addition to being able to be implemented on a 

25 variety of hardware platforms, the present invention may 
be implemented in a variety of software environments. A 
typical operating system may be used to control program 
execution within each data processing system. For 
example, one device may run a Unix® operating system, while 

30 another device contains a simple Java® runtime environment. 
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A representative computer platform may include a browser, 
which is a well known software application for accessing 
hypertext documents in a variety of formats and languages, 
such as graphic files, word processing files, Extensible 
5 Markup Language (XML) , Hypertext Markup Language (HTML) , 
Handheld Device Markup Language (HDML) , Wireless Markup 
Language (WML) , and various other formats and types of 
files . 

10 Virtual Machine Cluster/Grid 

The present invention may be implemented on a 
variety of hardware and software platforms, as described 
above with respect to FIG. lA and FIG. IB. More 
specifically, though, the present invention is directed 

15 to an improved virtual machine that includes 

computational characteristics of a grid within the 
improved virtual machine such that the improved virtual 
machine may interact with similar improved virtual 
machines in a grid-like manner, as described with respect 

20 to the remaining figures. Although the following figures 
are described as incorporating Java* technology such that 
the present invention may be implemented as improved Java* 
virtual machines (JVMs) , the present invention may be 
implemented in conjunction with technology for other 

25 types of virtual machines. 

With reference now to FIG. 2, a block diagram 
illustrates a virtual machine cluster/grid in a 
distributed data processing system in accordance with an 
implementation of the present invention. Network 200 

30 connects a set of virtual machines 201-205; network 200 

is similar to network 101 in FIG. 1, and virtual machines 
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201-205 may be supported on a variety of devices, such as 
any of the devices that are shown in FIG, 1. 

By way of introducing terminology, a cluster is a 
set of two or more interacting entities. A cluster of 
5 virtual machines may be referred to as a virtual machine 
cluster, a virtual machine grid, or a logical virtual 
machine. A single virtual machine that contains 
functionality for operating in a cluster/grid may be 
referred to as a clusterable virtual machine or a 
10 grid-enabled virtual machine. A single virtual machine 

that is included within a cluster/grid may be referred to 
as a clustered virtual machine or a grid-embedded virtual 
machine . 

As noted above, the present invention may be 

15 implemented within a Java* environment using Java® virtual 
machines that have been enhanced to include functionality 
for implementing an embodiment of the present invention. 
Alternatively, the present invention may be implemented 
within extensions to Java* class libraries. Thus, the 

20 terminology that is provided above may be used within a 
Java* context. Hereinafter, a grid-enabled JVM or a 
grid-embedded JVM is referred to as a gJVM. FIG. 2 shows 
network 200 connecting a gJVM cluster that contains 
gJVM's 201-205. 

25 A single clustered virtual machine or a single 

grid-embedded virtual machine may be referred to as a 
node within a given virtual machine cluster, a given 
virtual machine grid, or a given logical virtual machine. 
In the example that is shown in FIG. 2, a set of virtual 

30 machines interact as multiple nodes within a single 
virtual machine cluster in a grid-like manner. The 
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present invention supports a plurality of nodes in a 
virtual machine cluster, and a distributed data 
processing system may support more than one virtual 
machine cluster/grid. Moreover, a given device may 
5 support more than one clusterable/grid-enabled virtual 
machine or more than one clustered/grid-embedded virtual 
machine. In other words, a given device may support a 
first clustered/grid-embedded virtual machine and a 
second clustered/grid-embedded virtual machine, wherein 

10 the two clustered/grid-embedded virtual machines are 

included in two distinct and independent virtual machine 
clusters/grids; the two clustered/grid-embedded virtual 
machines and the two virtual machine clusters/grids would 
be uniquely identifiable and accessible. 

15 Depending on the type of process that is being 

performed within a given virtual machine cluster/grid, it 
may be convenient to refer to interacting nodes within 
the given virtual machine cluster/grid as a local node or 
a remote node in order to distinguish between nodes. For 

20 example, from the perspective of a given gJVM, the given 
gJVM may be referred to as the local node, the local 
gJVM, or the local gJVM node; from the perspective of a 
given gJVM, a different gJVM that is interacting with the 
local node, whether on the same device or whether 

25 executing on another device, may be referred to as a 
remote node, a remote gJVM, a remote gJVM node. 

The present invention is directed to transparently 
allowing an application to run on a set of machines or 
devices, each of which support at least one clustered 

30 virtual machine node. For example, a multithreaded Java* 
application may run over multiple gJVM's, and the 
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application would not be aware nor need be aware on which 
machines its threads were dispatched. The virtual 
machine cluster dynamically manages and distributes the 
threads of the application upon the available set of 
5 clustered virtual machines. 

Each virtual machine cluster has a single primary 
node. In FIG. 2, gJVM node 2 01 is a primary gJVM node, 
and gJVM nodes 202-205 are non-primary gJVM nodes. The 
primary node may have certain duties or responsibilities 

10 that are performed for the non-primary nodes within the 

cluster. For example, the primary node may have a set of 
objects that define the application environment and 
context for the application, such as for file access and 
class loading (classpath) for the virtual machine 

15 cluster. 

Primary nodes and non-primary nodes may be started 
in slightly different manners, thereby providing an 
indication to a given clusterable virtual machine that it 
is to act as a primary node or as a non-primary node. 

20 For example, a clusterable virtual machine in a Java* 

environment may be started with the following exemplary 
instantiation command: 

java [-main -clusterName] [mainClass] 
The ''java" command may instantiate a clusterable virtual 

25 machine, i.e., start the execution of a given copy of a 
clusterable virtual machine. The ''-main" parameter 
indicates that a particular instance of a gJVM is to act 
as a primary gJVM node within a gJVM cluster that is 
identified by the "-clusterName" parameter; when a 

30 particular instance of a gJVM is to act as a non-primary 
gJVM node, the instantiation command would not include 
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the ''-main" parameter. The "mainClass" parameter 
specifies the main program to be run by the gJVM. In 
this manner, the software modules that comprise a primary 
node and a non-primary node may be copies of each other, 
5 thereby making the software essentially similar, i.e., a 
primary node and a non-primary node may be so-called 
peers or clones, although a primary node and a 
non-primary node may exhibit different behavior due to a 
primary node undergoing a slightly different 

10 instantiation process than a non-primary node. 

Since a primary node and the non-primary nodes may 
simply be independent instances of a clusterable virtual 
machine, wherein the primary node and the non-primary 
nodes merely function in accordance with two different 

15 modes of operation, e.g., by being initialized in 
different manners, a primary node can delegate or 
hand-off its responsibilities to a non-primary node in 
order to enable the primary node to shutdown for 
maintenance or for other purposes after the system has 

20 been started. 

Grid-Enabled Virtual Machine 

With reference now to FIG. 3A, a block diagram 
illustrates components and/or information that may be 

25 used to enhance a virtual machine as a 

clusterable/grid-enabled virtual machine in accordance 
with an implementation of the present invention. FIG. 3A 
depicts gJVM node 300 that represents a single 
clusterable virtual machine, such as any of nodes 201-205 

30 in FIG. 2. Although not shown, gJVM node 300 comprises 
other components and information for performing various 
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tasks in a manner similar to a typical JVM; in other 
words, gJVM node 300 can be considered to be a typical 
JVM that has been enhanced to include the functionality 
of the present invention. Although gJVM node 300 is 
5 shown as containing multiple types of tables, one of 

ordinary skill in the art would understand that different 
types of information and different information formats 
may be managed by a given clusterable virtual machine 
node to effect the functionality of the present 

10 invention. 

P2P communication unit 302 allows each gJVM to 
communicate with any other gJVM in a peer-to-peer 
fashion, particularly to communicate a gJVM's performance 
information to another gJVM. For example, a gJVM may 

15 communicate its CPU load factor, its memory factor, and 

its communication queue load factors to other gJVM's such 
that the virtual machine cluster can balance its overall 
load in a distributed manner. Unique gJVM ID 304 is a 
data value that represents the unique virtual machine 

20 identifier that has been assigned to a particular gJVM 
within a given gJVM cluster that is identified by gJVM 
group name 306. Application environment/context 
parameters 308 define various values and information that 
may be needed during the lifetime of gJVM 300 for 

25 obtaining additional information from its runtime 
environment and for interacting with its runtime 
environment. For example, each gJVM may need to know the 
primary gJVM IP address and port values 310 at which the 
primary gJVM is configured in order to communicate with 

30 the primary gJVM . 
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Each clustered virtual machine has a run state, 
e.g., stopped, starting, started, and stopping. When a 
new node is being instantiated within a virtual machine 
cluster, i.e., added to the plurality of virtual machines 
5 already within the virtual machine cluster, information 
about its existence is disseminated throughout the 
virtual machine cluster; when a particular node within 
the virtual machine cluster is being stopped, i.e., 
removed from the remainder of the plurality of virtual 

10 machines within the virtual machine cluster, which may 
occur without stopping the entire virtual machine 
cluster, information about the stoppage of the particular 
node is disseminated throughout the virtual machine 
cluster. In the example that is shown in FIG. 3A, it is 

15 assumed that each gJVM node maintains a local copy of a 
list of gJVM nodes within the gJVM cluster, which is 
shown as gJVM registry 312/ gJVM node 300 may receive 
updated information about the status of other nodes, 
which is then reflected within gJVM registry 312. In 

20 this manner, gJVM registry 312 provides a registry of 
gJVM's that are known to gJVM node 300. gJVM registry 
312 contains a list of characteristic values for each 
known gJVM, such as its unique gJVM ID 314, its IP 
address and port 316, and its run state 318. Each of the 

25 gJVM's within the gJVM cluster may communicate various 

workload or computational resource utilization parameters 
to each other so that they can determine where to 
dispatch threads. 

After a gJVM has been started, it may accept thread 

30 work from its gJVM cluster; when it is stopping, it moves 
its threads and associated objects to another gJVM. In 
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contrast to a typical virtual machine, the present 
invention provides a clusterable virtual machine in which 
objects are movable between gJVM nodes, thereby allowing 
a thread and its associated objects to be moved when an 
5 instance of a clusterable virtual machine is being 
stopped. 

In a typical prior art system, remote objects are 
addressed via a combination of an IP address, a TCP/IP 
port number, and an object identifier reference; remote 

10 method invocation (RMI) references or Common Object 
Request Broker Architecture (CORBA) references also 
contain other information of relatively large size. In 
contrast to a typical virtual machine in which stub 
classes might be used in a typical RMI or CORBA 

15 implementation, the present invention provides object 
references such that access to objects is managed 
transparently within the gJVM runtime environment without 
stub classes, as explained in more detail below. 

To support movable objects within the present 

20 invention, objects are associated with an object group, 
i.e., logically contained within an object group. As 
long as an object exists, its association with a 
particular object group ensures that the object group 
will continue to exist. In other words, an object group 

25 continues to exist as long as it has at least one object. 
When all of the objects in an object group have been 
garbage collected, then the object group would be garbage 
collected as well. 

Object groups can be moved amongst clustered virtual 

30 machines. An accessible object group exists solely on a 
single clustered virtual machine; access to an object 



AUS920030872US1 

19 

group may be restricted while the object group is in a 
migratory state between clustered virtual machines. 
Hence, each clustered virtual machine has information 
about the location of an object group within the virtual 
5 machine cluster. 

In the example that is shown in FIG. 3A, object 
group registry 320 contains a list of object group IDs 
that are known to gJVM 300 and other associated 
information to locate those object groups within the gJVM 

10 cluster of gJVM 300, thereby associating each known 

object group with a gJVM that is known to gJVM 300. For 
example, a particular object group is represented by 
object group ID 322; the gJVM node on which the object 
group assigned to object group ID 322 is identified by 

15 gJVM ID 324, and the migratory state of the object group 
assigned to object group ID 322 is identified by 
migration state value 326, which may be one of a set of 
constant values, each of which represents a different 
possible migratory state. 

20 In order to provide transparent access and provide 

movability of an object within a virtual machine cluster, 
the internal virtual machine's handle to the object, 
i.e., the object reference, contains a compact set of 
information items that provides for locating and 

25 accessing the object but does not specify the object's 

location. For example, the following Java® structure may 
be used to specify an object handle in a Java* 
environment : 
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Struct ObjectHandle { 

Int ob j ectGroupID; // Unique object group ID 
Byte[] objectID; // Unique object ID of the object 

} 

5 The ObjectHandle that is specified in the above-provided 
example allows an application that is executing within 
the gJVM cluster to access the corresponding object 
anywhere within the gJVM cluster. By knowing the unique 
object group identifier of a group of objects, any gJVM 

10 node can locate the identified object group within the 

gJVM cluster; for example, if the object group is managed 
in a table or other type of datastore that associates the 
object group identifier with a gJVM identifier, then the 
gJVM node on which the object resides can be determined. 

15 By using other information about the manner in which the 
gJVM node is addressed or accessed, the unique gJVM 
identifier can be used to communicate with the gJVM node 
on which the object resides; for example, if the gJVM 
identifier is managed in a table or other type of 

20 datastore that associates the gJVM identifier with an IP 
address and port number, e.g., as shown in gJVM registry 
312, then the appropriate object group on the appropriate 
gJVM node can be accessed in some manner. Although a 
given gJVM node may be identified by an IP address and a 

25 port number, this information is not contained directly 
within an object reference, thereby allowing the object 
to be transparently moved amongst gJVM nodes. By knowing 
the unique object identifier for the object to be 
accessed, then the appropriate object within the object 

30 group on the appropriate gJVM node can be accessed in 
some manner . 
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Information about the locations of the object groups 
may be provided by a JVM singleton object that all gJVM 
nodes within the gJVM cluster can access to lookup the 
association between an object group identifier and the 
5 gJVM node that is hosting the object group. In the 

example that is shown in FIG, 3A, gJVM node 300 contains 
local information provided by object group registry 320; 
e.g., gJVM node 300 may retrieve or receive updates to 
the object group registry 32 0 from another gJVM node, 

10 e.g., the primary gJVM node, thereby allowing gJVM node 
300 to perform quick and efficient local lookups. 

Objects and their object groups are created within 
the heaps of a local gJVM node as necessary. An object 
may be referenced locally until an. object reference is 

15 passed out of thread scope, at which point the object 
would be transparently accessed as a remote object, 
thereby allowing other threads potentially running on 
other machines to access it. In one embodiment, objects 
groups are preferably created in the thread local heaps 

20 of a gJVM node; alternatively, object groups may be 

created within other types of heaps within a clustered 
virtual machine. Each new thread is provided a new 
thread local heap when the thread is constructed; a 
thread may acquire additional thread local heaps as 

25 necessary. A thread local heap provides high performance 
because: most accesses of most objects are local to a 
particular thread; the objects can be easily 
garbage-collected; and a heap lock is not required to 
satisfy object allocations. A thread local heap is 

30 typically implemented as an object of a configurable size 
to which a particular thread has access. To satisfy a 
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typical small object allocation, the thread local heap is 
split at one end to release the required space. If the 
thread local heap is not large enough to satisfy an 
allocation, the remainder of the thread local heap is 
5 discarded and is replaced by a fresh thread local heap. 

Large object allocations may bypass the thread local heap 
mechanism and may be satisfied directly from a larger 
heap within the virtual machine. FIG. 3A depicts a 
single thread 328 within gJVM node 300; thread 328 is 

10 associated with two thread local heaps 330 and 332, 

thereby illustrating that a thread may have a one-to-many 
association with multiple thread local heaps. Thread 
local heap 330 is shown as containing object group 334 
and object group 336, which contain objects that are not 

15 shown in the figure; thread local heap 332 is shown as 
containing object group 338, which itself contains 
objects 340-342. 

Information for managing a thread is stored within 
an object, and this object is also stored within an 

20 object group. Similarly, a thread's stack frame is 

managed as an object. FIG. 3A depicts thread management 
object 344 and stack frame object 346 for managing such 
information. In this manner, the instance data related 
to a thread is stored within an object group. If a 

25 thread is moved from one gJVM node to another gJVM node, 
the instance data is moved as part of the process of 
moving an object group. The thread code itself does not 
need to be stored in an object group; if the code is not 
present within memory, it is available from class files 

30 when necessary. 
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A new object group is also created when a new thread 
is constructed; an association is created between the 
object group and the newly constructed thread, which has 
a unique thread identifier. Thread identifiers are 
5 unique throughout a cluster; each clustered virtual 

machine has a unique virtual machine identifier, thereby 
allowing unique thread identifiers preferably to be based 
on a combination of the unique virtual machine identifier 
and a thread identifier that is unique within a given 

10 virtual machine. 

Additional object groups may be added to a thread as 
necessary. In the example that is shown in FIG. 3A, 
local object group management table 350 contains a list 
of the object groups that are managed within gJVM node 

15 3 00, i.e., the object groups that are stored within gJVM 
node 300, along with other associated information to 
manage those object groups within gJVM node 300. For 
example, a particular object group is represented by a 
row in the table, which contains object group ID 352, 

20 pointer 354 to the thread local heap that contains the 
particular object group, and thread identifier 356 that 
is associated with the object group; if object groups are 
alternatively created within other types of heaps within 
a clustered virtual machine, then different information 

25 would be managed within local object group management 
table 350. 

As mentioned above, an object group may continue to 
exist as long as it has at least one object; even if the 
thread that was responsible for creating an object is 
30 terminated, the object may have been referenced by 
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another thread that still requires its existence, thereby 
requiring the object group in which the object is located 
to continue to exist. Alternatively, master global 
object group 3 60 may be maintained within a gJVM node. 
5 When a thread is ended, all of the remaining objects that 
are contained in its associated object group (or 
associated object groups) that are still ref erenceable 
from other threads, e.g., objects 362-368, are moved to 
the master global object group. The master global object 
10 group would contain all objects that are referenced 
globally within the gJVM cluster even when the 
originating thread has ended, thereby allowing management 
of fewer object groups. 

15 Object Group Movement within Virtual Machine Cluster/Grid 
With reference now to FIGs. 3B-3D, a set of block 
diagrams illustrate the transference of object groups 
between clustered virtual machines in accordance with an 
embodiment of the present invention. FIGs. 3B-3D depict 

20 a set of gJVM nodes within a virtual machine cluster 

across a period of time; FIG. 3B depicts a point in time 
that is earlier than a point in time that is depicted in 
FIG. 3C, which is earlier than the point in time that is 
depicted in FIG. 3D. In a manner similar to FIG. 3A, 

25 FIG- 3B shows gJVM node 300 containing object groups 334, 
336, and 338. In addition, gJVM node 370 contains object 
group 372, and gJVM node 380 contains object group 382. 

As noted above, an object group may continue to 
exist as long as it has at least one object, even if the 

30 thread that was initially associated with the object 
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group is terminated. FIG. 3C depicts the same gJVM nodes 
and the same object groups as FIG. 3B. However, FIG. 3C 
differs from FIG. 3B in that object group 338 has been 
transferred from gJVM node 300 to gJVM node 380; for 
5 example, the transfer may have occurred in response to 
the termination of the thread that was associated with 
object group 338, such as thread 328 that is shown in 
FIG. 3A. 

As also noted above, an object group may continue to 
10 exist as long as it has at least one object, even if the 
clustered virtual machine which had constructed the 
object group has been stopped or terminated. FIG. 3D 
depicts a later point in time with some of the same gJVM 
nodes and object groups as FIG. 3C. However, FIG. 3D 
15 differs from FIG. 3C in that object groups 334 and 336 
have been transferred from gJVM node 300 to gJVM node 
370. For example, the transfer may have occurred in 
response to the termination of gJVM node 3 00 that was 
supporting object groups 334 and 336; when a given gJVM 
20 shuts down, the heaps and/or object groups on the given 
gJVM can be transferred to another machine, and the 
information in the gJVM nodes throughout the virtual 
machine cluster would be updated with the location of the 
object groups. 

25 Garbage collection may also be performed within a 

virtual machine cluster. In a typical Java* virtual 
machine, garbage collection is done with a mark-and- sweep 
approach. In the mark-and- sweep approach, garbage 
collection is invoked either explicitly by an application 

30 or automatically when an application tries to allocate an 
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object but memory utilization causes the object 
allocation operation to be difficult or not possible. 
The virtual machine goes through all objects, marks them, 
then sweeps all threads and statics (global data) to 
5 again mark all objects that can be traversed to. The 

virtual machine pauses all threads and then looks through 
the data structures of each thread's stack frame, whereby 
it marks all objects that are ref erenceable by any 
thread, possibly on an application by application basis. 

10 All objects that are not marked as ref erenceable are then 
destroyed or removed, thereby freeing memory for the 
virtual machine to allocate an object. 

The majority of objects are typically objects that 
are only accessed by one thread. With the virtual 

15 machine cluster of the present invention, an object may 
be referenced remotely; when an object is remotely 
referenced, it is marked as such. In one embodiment, a 
gJVM node may have a separate daemon thread that runs at 
low priority and that periodically traverses the system 

20 to perform remote garbage collection processing. Each 
object may have a remote garbage collection bit that is 
to be used in a mark-and- sweep approach by the remote 
garbage collection daemon. If an object that has been 
remotely referenced is no longer being referenced within 

25 the gJVM node in which it was constructed, then the 
remote garbage collection daemon thread endeavors to 
determine whether the remote object reference still 
requires the existence of the object. The daemon thread 
may send a request message to other gJVM nodes to inquire 

30 whether a particular object that may be ready to be freed 
within the local gJVM is still being referenced by a 
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remote gJVM; these requests may be batched up for 
multiple objects such that groups of references could be 
checked efficiently. If no remote gJVM nodes report that 
a particular object is still being remotely referenced, 
5 then the remote garbage collection daemon on the local 
gJVM node may initiate the objects removal; e.g., the 
remote garbage collection daemon might remove any 
indication that the object is/was remotely referenced, 
thereby allowing the object to be subsequently removed 
10 during the normal/local garbage collection process. 

Initialization of Primary Grid-Enabled Virtual Machine 

With reference now to FIG. 4, a flowchart 
illustrates a process for initializing a primary gJVM in 

15 accordance with an embodiment of the present invention. 

The process commences when the primary clustered virtual 
machine node is initialized (step 402), e.g., through the 
use of the instantiation command that was described 
above. The gJVM cluster name is registered in a manner 

20 that allows applications to lookup available gJVM 

clusters and that prevents name collisions with other 
gJVM clusters (step 404) . The primary gJVM node obtains 
any environment parameters or context parameters that are 
to be employed by the newly started gJVM cluster (step 

25 406) and stores the information in a manner that is 

available to all nodes in the gJVM cluster. The primary 
copy of the object group registry within the primary gJVM 
node is initialized (step 408) , which is available 
thereafter for dissemination as copies to non-primary 

30 gJVM nodes, such as object group registry 320 that is 

shown in FIG. 3A. The primary copy of the gJVM registry 
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within the primary gJVM node is also initialized (step 
410) , which is also available thereafter for 
dissemination as copies to non-primary gJVM nodes, such 
as gJVM registry 312 that is shown in FIG- 3A, thereby 
5 concluding the process. 

Allocation or Registration of Unique IDs For Grid-Enabled 
Virtual Machine Nodes 

With reference now to FIG. 5A, a flowchart 

10 illustrates a process at a primary gJVM for allocating a 
unique gJVM identifier to a non-primary gJVM that is 
being initialized. The process commences when the 
primary gJVM receives a request from a non-primary gJVM 
for a unique gJVM identifier (step 502), The primary 

15 gJVM generates the unique gJVM identifier (step 504) and 
adds the newly generated gJVM identifier to the gJVM 
registry (step 506) . The primary gJVM sets the run state 
of the non-primary gJVM in the gJVM registry to a 
starting state value (step 508) , and then the primary 

20 gJVM responds to the requesting non-primary gJVM by 

sending a message containing the newly generated unique 
gJVM identifier (step 510) . The primary gJVM then 
broadcasts a message that contains the updated gJVM 
registry to all of the non-primary gJVM nodes (step 512) , 

25 thereby concluding the process. 

With reference now to FIG. 5B, a flowchart 
illustrates an alternative process at a primary gJVM for 
allocating a unique gJVM identifier to a non-primary gJVM 
that is being initialized. The process commences when 

30 the primary gJVM receives a request from a non-primary 
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gJVM to register a unique gJVM identifier (step 522) . In 
contrast to FIG. 5A, in which a non-primary gJVM node 
relies upon the primary gJVM node to allocate a unique 
gJVM identifier, FIG. 5B shows an exemplary system in 
5 which the non-primary gJVM node allocates its own unique 
gJVM identifier and then registers its identifier with 
the primary gJVM node. The non-primary gJVM node may 
generate its own identifier that is guaranteed to be 
unique through execution of an algorithm for such 

10 purpose. For example, many devices are manufactured to 

include a readable, unalterable, serial number, sometimes 
referred to as a global unique identifier; since each 
gJVM node within a distributed gJVM cluster operates on a 
device, the gJVM node may use an algorithm to create a 

15 unique gJVM identifier, wherein the algorithm accepts a 
device's global unique identifier as input. 

The primary gJVM then adds or registers the new gJVM 
identifier into the gJVM registry (step 524) . The 
primary gJVM sets the run state of the non-primary gJVM 

20 in the gJVM registry to a starting state value (step 
526) , and then the primary gJVM responds to the 
requesting non-primary gJVM by sending a response message 
with a status value that indicates whether or not the 
registration was successful (step 528) . The primary gJVM 

25 then broadcasts a message that contains the updated gJVM 
registry to all of the non-primary gJVM nodes (step 530) , 
thereby concluding the process. 



AUS920030872US1 

30 

Allocation of an Object Group ID to a Non-Priiaarv 
Grid-Enabled Virtual Machine Node 

With reference now to FIG. 6, a flowchart 
5 illustrates a process at a primary gJVM for allocating a 
unique object group identifier to a non-primary gJVM. 
The process commences when the primary gJVM receives a 
request from a non-primary gJVM for a unique object group 
identifier (step 602) . The primary gJVM generates the 

10 unique object group identifier (step 604) and adds the 
newly generated object group identifier to the object 
group registry (step 606) . The primary gJVM sets the 
migration state of the object group in the object group 
registry to a starting state value (step 608) , and then 

15 the primary gJVM responds to the requesting non-primary 
gJVM by sending a message containing the newly generated 
unique object group identifier (step 610) . The primary 
gJVM then broadcasts a message that contains the updated 
object group registry to all of the non-primary gJVM 

20 nodes (step 612), thereby concluding the process. 

Alternatively, in contrast to FIG. 6 in which a 
non-primary gJVM node relies upon the primary gJVM node 
to allocate a unique object group identifier, an 
alternative process may be employed in which the 

25 non-primary gJVM node allocates its own unique object 

group identifier and then registers this newly generated 
object group identifier with the primary gJVM node. The 
non-primary gJVM node may generate its own object group 
identifier that is guaranteed to be unique through 

30 execution of an algorithm for such purpose, e.g., in a 
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manner similar to that described above with respect to 
FIG. 5B. 

Providing Cluster -Specif ic Information to Non-Primary 
5 Grid-Enabled Virtual Machine Nodes 

With reference now to FIG. 7, a flowchart 
illustrates a process at a primary gJVM for providing 
information about the gJVM cluster to a non-primary gJVM. 
The process commences when the primary gJVM receives a 

10 request from a non-primary gJVM for cluster-specif ic 
information, such as the current gJVM registry, the 
current object group registry, or the cluster's 
environment/context parameters (step 702) . The primary 
gJVM responds to the requesting non-primary gJVM by 

15 sending a message containing the requested information 
(step 704), thereby concluding the process. In this 
manner, a non-primary gJVM node is able to obtain 
information from a primary gJVM node, assuming that an 
implementation of the present invention has a gJVM 

20 cluster in which a primary gJVM node manages 

cluster-specific information on behalf of non-primary 
gJVM nodes. 

Changing Run State of Non-Primary Grid-enabled Virtual 

25 Machine Node 

With reference now to FIG. 8, a flowchart 
illustrates a process at a primary gJVM for changing a 
run state of a non-primary g JVM . The process commences 
when the primary gJVM receives a request from a 

30 non-primary gJVM to change its run state (step 802) . The 
primary gJVM updates the gJVM registry with the requested 
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run state (step 804) and responds to the requesting 
non-primary gJVM with an acknowledgment message (step 
806) . The primary gJVM then broadcasts a message that 
contains the updated gJVM registry to all of the 
5 non-primary gJVM nodes (step 808) , thereby concluding the 
process . 

Changing Object Group Migration State by Non-Primary 
Grid- enabled Virtual Machine Node 

10 With reference now to FIG. 9, a flowchart 

illustrates a process at a primary gJVM for changing a 
migration state of an object group. The process 
commences when the primary gJVM receives a request from a 
non-primary gJVM to change a migration state of an object 

15 group (step 902) . The primary gJVM updates the object 
group registry with the requested migration state (step 
904) and responds to the requesting non-primary gJVM with 
an acknowledgment message (step 906) . The primary gJVM 
then broadcasts a message that contains the updated 

20 object group registry to all of the non-primary gJVM 
nodes (step 908), thereby concluding the process. 

Initialization of Non-Primary Grid-Enabled Virtual 
Machine Node 

25 With reference now to FIG. 10, a flowchart 

illustrates a process for initializing a non-primary gJVM 
node in accordance with an embodiment of the present 
invention. The process commences when the non-primary 
clustered virtual machine node is initialized (step 

30 1002), e.g., through the use of the instantiation command 
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that was described above. The gJVM cluster name is 
obtained, e.g., from the instantiation command (step 
1004), and a lookup operation is performed using the gJVM 
cluster name (step 1006), e.g., to obtain network 
5 parameters from a globally available database to initiate 
communication with the primary gJVM node for the 
identified gJVM cluster (step 1008) . 

The non-primary gJVM node requests and receives a 
unique gJVM identifier from the primary gJVM node (step 

10 1010), e.g., as shown in PIG. 5A or FIG. 5B. The 
non-primary gJVM node obtains any cluster-wide 
information, such as environment parameters or context 
parameters that are used by the gJVM cluster (step 1012) , 
e.g., as shown in FIG. 7, The non-primary gJVM node also 

15 requests and receives from the primary gJVM node the gJVM 
registry (step 1014) and the object group registry (step 
1016), e.g., as shown in FIG. 7. Although steps 
1010-1016 are illustrated as being implemented in a 
synchronous manner, the steps may be performed 

20 asynchronously in a variety of manners. Since the 

non-primary gJVM node has completed its initialization, 
the non-primary gJVM node sends to the primary gJVM node 
a request to change its run state from a value that 
indicates that it is in a starting state to a value that 

25 indicates that it is in a running state (step 1018) . 

Assuming that the request was successfully executed, the 
non-primary gJVM node receives an acknowledgment from the 
primary gJVM node (step 1020), thereby concluding the 
process. 
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Whereas FIG. 10 illustrates an initialization 
process at a non-primary gJVM node, FIGs . 5A-9 illustrate 
processes by which a primary gJVM node performs tasks as 
requested by non-primary gJVM nodes to centrally manage 
5 information on behalf of the gJVM cluster as a whole. 

The remaining figures depict processes through which the 
non-primary gJVM nodes manage object groups and threads 
within a gJVM cluster; in other words, the remaining 
figures illustrate processes from the perspective of a 

10 non-primary gJVM node. However, it should be noted that 
a primary gJVM node executes similar processes with 
respect to management of its object groups and threads; 
in those instances in which a figure illustrates a step 
in a process in which information is transferred to or 

15 received from a primary gJVM node, it should be 

understood that a primary gJVM node would execute a 
similar step except that the primary gJVM node could 
place information directly into cluster- specif ic tables 
or databases, such as the gJVM registry or the object 

20 group registry, without having to coordinate the step 
with itself, i.e., the primary gJVM node. 

Thread Construction in Grid-Enabled Virtual Machine Node 
With reference now to FIG. 11, a flowchart 

25 illustrates a process on a clustered virtual machine 
during which a thread is constructed. The process 
commences when a local gJVM receives a request to 
construct a thread (step 1102), e.g., through an 
appropriate API call by an application that is executing 

30 under the control of the clustered virtual machine, 

although it is possible for the local gJVM to perform a 
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similar process on its own initiative while it controls 
the execution of applications on the virtual machine. 
The local gJVM node, which in some cases may be assumed 
to be a non-primary gJVM, sends a request to the primary 
5 gJVM for a unique object group identifier and then 

receives it (step 1104) . The local gJVM constructs the 
thread (step 1106) , which may entail the creation of 
various thread management objects and the updating of 
various thread management tables. A unique thread 

10 identifier is generated and assigned to the newly 

constructed thread (step 1108) • The local gJVM then 
allocates a thread local heap (step 1110) and creates or 
demarcates an object group within the thread local heap 
(step 1112), which, depending on the implementation, 

15 preferably stores the thread-related objects. The newly 
created object group identifier is associated with the 
newly constructed thread (step 1114), e.g., by updating 
the appropriate managettit'ent tables . 

The local gJVM then sends a request to the primary 

20 gJVM to change the migration state of the object group 
from a value indicating a starting state, which was set 
by the primary gJVM when the new object group identifier 
was requested, to a value indicating a stable state (step 
1116) . The local gJVM then receives a migration state 

25 change acknowledgment from the primary gJVM (step 1118) . 
The local gJVM then returns a response for the thread 
construction, which may contain the thread identifier and 
other thread-related information, to the requesting 
application (step 1120), thereby concluding the process. 
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Thread Destruction in Grid-Enabled Virtual Machine Node 

With reference now to FIG, 12, a flowchart 
illustrates a process on a clustered virtual machine 
5 during which a thread is destroyed. The process 

commences when a local gJVM receives a request to destroy 
a thread (step 1202), e.g., through an appropriate API 
call by an application that is executing under the 
control of the clustered virtual machine, although a 

10 similar process may occur when the local gJVM decides to 
destroy a thread on its own initiative, e.g., due to 
various termination conditions. 

Assuming that the object groups that are associated 
with the terminating thread have external/remote/foreign 

15 references to objects within themselves, then the object 
groups need to be preserved, even though the thread that 
was originally the reason for creating the object groups 
is now being terminated or destroyed. Hence, the local 
gJVM disassociates the thread identifier of the 

20 terminating thread from each associated object group in 
an appropriate manner (step 1204) , which may include 
updating various thread management tables and object 
group management tables (step 1206) . If necessary, 
depending upon the memory allocation methods in the local 

25 gJVM, object groups may need to be moved from thread 

local heaps to general heaps within the local gJVM (step 
1208) . It may often be the case that terminating a 
thread results in almost all of its objects being garbage 
collected; in an alternative embodiment, objects that 

30 remain because they are remotely ref erenceable may be 
moved to a master global object group. 
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The thread is then destroyed (step 1210), e.g., by 
destroying the thread-related management objects that 
were used to control the thread. A response for the 
thread destruction operation is then returned to the 
5 requesting application (step 1212) , thereby concluding 
the process. 

Object Allocation in Grid-Enabled Virtual Machine Node 

With reference now to FIG. 13, a flowchart 

10 illustrates a process on a clustered virtual machine 
during which an object is constructed. The process 
commences when the local gJVM node receives a request to 
construct an object (step 1302) e.g., through an 
appropriate API call by an application that is executing 

15 under the control of the clustered virtual machine, 

although a similar process may occur when the local gJVM 
decides to create an object on its own initiative. The 
thread identifier of the requesting thread is determined 
(step 1304), and the object group information for an 

20 object group that is associated with the requesting 

thread is retrieved (step 1306) . The local gJVM then 
allocates an object in one of the requesting thread's 
object groups (step 1308), and an object identifier for 
the newly allocated object is generated and assigned to 

25 the object (step 1310) . An object reference/handle for 
the newly created object is created (step 1312), which 
may contain the object group identifier and the object 
identifier, depending upon the manner in which object 
references are implemented. The object reference/handle 
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is then returned to the requesting thread (step 1314) , 
thereby concluding the process. 

Object Referencing in Grid-Enabled Virtual Machine Node 
5 With reference now to FIG. 14, a flowchart 

illustrates a process on a clustered virtual machine for 
handling accesses to objects. The process commences with 
the local gJVM intercepting an attempt to access or 
reference an object via an object handle (step 1402) . 

10 The local gJVM retrieves the object group identifier from 
the object handle (step 1404) and determines whether the 
identified object group is managed locally or remotely 
(step 1406); e.g., by consulting an object group 
management table. If the object is stored locally, then 

15 the local gJVM obtains the pointer to the appropriate 

thread local heap (step 1408) and accesses the object in 
the thread local heap using the object identifier (step 
1410) . 

If the object is stored remotely, then the local 
20 gJVM obtains the identifier of the remote gJVM node of 
the object group in which the object is located (step 
1412) . The IP address and port ID of the remote gJVM is 
retrieved (step 1414), e.g., from the local copy of the 
gJVM registry. The local gJVM then sends an object 
25 access request to the remote gJVM (step 1416) , thereby 
concluding the process. 
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Migration of Object Group From Grid-Enabled Virtual 
Machine Node 

With reference now to FIG. 15, a flowchart 
5 illustrates a process on a local clustered virtual 
machine for migrating an object group to a remote 
clustered virtual machine. The process commences when 
the local gJVM node prepares to migrate an object group 
from the local gJVM node to a remote gJVM node (step 

10 1502) , which may entail sending a request message to the 
remote gJVM to migrate the object group. This 
preparation to migrate an object group may occur as a 
result of various conditions, which are described in more 
detail further below. For example, as described with 

15 respect to FIG, 19, prior to stopping a gJVM node, any 
object groups within the gJVM node would be migrated to 
other gJVM nodes with the gJVM cluster. 

The local gJVM sends a request to the primary gJVM 
to change the migration state of the migrating object 

20 group from stable to migrating (step 1504), after which 
the local gJVM receives a migration state change 
acknowledgment from the primary gJVM (step 1506) . The 
local gJVM then flattens or serializes each object in the 
migrating object group (step 1508) and pushes the 

25 flattened or serialized objects from the local gJVM to 
the remote gJVM in some manner such that the object 
remain associated with the object group (step 1510) , 
after which the local gJVM receives an acknowledgment 
from the remote gJVM (step 1512) . The local gJVM then 

30 destroys the object group and/or heap within the local 
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gJVM for the object group that has just been moved (step 
1514) while also clearing or updating the object group 
entry in the local object group table along with any 
other table or database entries that need to be cleared 
5 or updated within the local gJVM node (step 1516) because 
the object group is now located within a different gJVM 
node, thereby concluding the process. 

Migration of Object Group To Grid-Enabled Virtual Machine 

10 With reference now to FIG. 16, a flowchart 

illustrates a process on a local clustered virtual 
machine for accepting the migration of an object group 
from a remote clustered virtual machine. The process 
that is shown in FIG. 16 should be viewed as 

15 complementary to the process that is shown in FIG. 15; 
the local gJVM and the remote gJVM in FIG. 15 are the 
remote gJVM and the local gJVM in FIG. 16, respectively 
reversing their roles. 

The process commences when the local gJVM receives a 

20 request to migrate an object group from the remote gJVM 

to the local gJVM (step 1602), after which the local gJVM 
allocates thread local heap or other appropriate type of 
heap (step 1604) and receives the flattened or serialized 
objects in the object group from the remote gJVM (step 

25 1606) . The local gJVM reads or inflates the objects into 
the chosen location within the local gJVM (step 1608) and 
associates the received object group with a thread (step 
1610), if necessary, e.g., if a thread was migrated along 
with the object group. The local gJVM then performs the 

30 appropriate administrative tasks for managing the newly 
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received object group, such as creating an object group 
entry in the local object group table (step 1612). The 
local gJVM then sends a request to the primary gJVM to 
change the migration state of the migrating object group 
5 from migrating to stable (step 1614) , after which the 
local gJVM receives a migration state change 
acknowledgment from the primary gJVM (step 1616) . The 
local gJVM then sends an acknowledgment message to the 
remote gJVM that indicates the completion of the 
10 migration of the object group (step 1618) , thereby 
concluding the process . 

Migration of Thread From Grid-Enabled Virtual Machine 

With reference now to FIG. 17, a flowchart 

15 illustrates a process on a local clustered virtual 

machine for migrating a thread to a remote clustered 
virtual machine. The process commences when the local 
gJVM node prepares to migrate a thread from the local 
gJVM node to a remote gJVM node (step 17 02) , which may 

20. entail sending a request message to the remote gJVM to 

migrate the thread. This preparation to migrate a thread 
may occur as a result of various conditions, which are 
described in more detail further below. The local gJVM 
pushes the thread management information from the local 

25 gJVM to the remote gJVM (step 1704) ; the thread 

management information may comprise information in the 
local thread management tables within the local gJVM 
node, after which the local gJVM receives an 
acknowledgment from the remote gJVM (step 17 06) . The 

30 local gJVM then updates the local thread management 
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tables as necessary to reflect the fact that the thread 
has been moved to a different gJVM node (step 1708) , 
thereby concluding the process. 

5 Migration of Thread To Grid-Enabled Virtual Machine 
With reference now to FIG, 18, a flowchart 
illustrates a process on a local clustered virtual 
machine for accepting the migration of a thread from a 
remote clustered virtual machine. The process that is 

10 shown in FIG. 18 should be viewed as complementary to the 
process that is shown in FIG, 17; the local gJVM and the 
remote gJVM in FIG, 17 are the remote gJVM and the local 
gJVM in FIG. 18, respectively reversing their roles. 

The process commences when the local gJVM receives a 

15 request to migrate a thread from the remote gJVM to the 
local gJVM (step 1802), after which the local gJVM 
receives the thread management information for the thread 
from the remote gJVM (step 1804) . The local gJVM then 
performs the appropriate administrative tasks for 

20 managing the newly received thread, such as updating the 
local thread management tables (step 1806) . The local 
gJVM then sends an acknowledgment message to the remote 
gJVM that indicates the completion of the migration of 
the thread (step 1808), thereby concluding the process. 

25 

Shutdown of Grid-Enabled Virtual Machine 

With reference now to FIG. 19, a flowchart 
illustrates a process on a local clustered virtual 
machine for performing a shutdown operation on a 
30 clustered virtual machine. The process commences with 
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the determination that the local gJVM needs to perform a 
shutdown operation (step 1902). The shutdown operation 
may be the result of a determination within the local 
gJVM node to self -terminate for a variety of reasons; 
5 alternatively, the determination to shutdown the gJVM 
node may be controlled by the primary gJVM node, by an 
administrative utility, by an administrative command, or 
by some other mechanism. The local gJVM node sends a 
request to the primary gJVM node to change the run state 

10 of the local gJVM node from a value that indicates a 

running state to a value that indicates a stopping state 
(step 1904) , after which the local gJVM node receives an 
acknowledgment message from the primary gJVM (step 1906) . 
The local gJVM then begins to migrate all of its threads 

15 to other gJVM nodes (step 1908) while also migrating all 
of its object groups to other gJVM nodes (step 1910) . As 
discussed in more detail below, a gJVM node may perform 
the migration operations in accordance with information 
about the performance characteristics of other gJVM nodes 

20 within the gJVM cluster. The local gJVM node then sends 
a request to the primary gJVM node to change the run 
state of the local gJVM node from a value that indicates 
a stopping state to a value that indicates a stopped 
state (step 1912) , after which the local gJVM node 

25 receives an acknowledgment message from the primary gJVM 
(step 1914), thereby concluding the process. 

As mentioned above, a primary node can delegate or 
hand-off its responsibilities to a non-primary node in 
order to enable the primary node to shutdown for 

30 maintenance or for other purposes after the system has 
been started. If a primary node starts a shutdown 
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process, the primary node selects a non-primary node to 
act as the newly delegate primary node, possibly through 
a request/response mechanism. In this manner, the 
primary node performs additional operations that are not 
5 performed by a non-primary node during the period of time 
that the primary node is in a stopping state. Subsequent 
to the delegation operation, the newly delegated primary 
node may broadcast its new identity as the primary node 
for the virtual machine cluster to the other non-primary 
10 nodes within the virtual machine cluster, which would 

update their informational data structures accordingly. 

Performance Information Gathering bv Grid-Enabled Virtual 
Machine 

15 With reference now to FIG. 20, a flowchart 

illustrates a process on a local clustered virtual 
machine for gathering load or performance information 
from a remote clustered virtual machine. Th^ process 
that is shown in FIG, 20 performs an operation with 

20 respect to all of the known remote gJVM nodes in an 

iterative manner. Hence, the process commences with the 
local gJVM node obtaining information from the local copy 
of the gJVM registry about a next remote gJVM node that 
has not yet been processed within the iterative loop 

25 (step 2 002) . The local gJVM then sends a request message 
to the identified remote gJVM node for its load 
information (step 2004), after which the local gJVM 
receives a response message that contains the load 
information for the remote gJVM node (step 2006); the 

30 information may be stored within a local table or 

database for subsequent analysis, e.g., using the process 



AUS920030872US1 

45 

that is described with respect to FIG. 22 hereinbelow. 
The local gJVM node then checks if there are any- 
unprocessed gJVM nodes within the local gJVM registry 
(step 2008) . If so, then the process branches back to 
5 step 2 002 to process another remote gJVM node; if there 
are not any unprocessed remote gJVM nodes, then the 
process is concluded. 

Performance Information Gathering by Grid-Enabled Virtual 
10 Machine 

With reference now to FIG. 21, a flowchart 
illustrates a process on a local clustered virtual 
machine for responding to a request from a clustered 
virtual machine for the current load or performance 

15 information at the local clustered virtual machine. The 
process that is shown in FIG. 21 should be viewed as 
complementary to the process that is shown in FIG. 20; 
the local gJVM and the remote gJVM in FIG. 20 are the 
remote gJVM and the local gJVM in FIG. 21, respectively 

20 reversing their roles. The process commences when the 
local gJVM receives a request message for the load 
information on the local gJVM and/or the local device 
(step 2102) . For example, the load information may 
include values that indicate CPU utilization, memory 

25 utilization, other hardware, software, or computational 
resource utilization information for the data processing 
device on which the local gJVM node is operating, 
computational resource utilization information for 
resources within or controlled by the gJVM, and other 

30 information. The local gJVM then sends a response 
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message containing the requested load information (step 
2104), thereby concluding the process. 

Load Balancing Operation Between Grid-Enabled Virtual 
5 Machine Nodes 

With reference now to FIG. 22, a flowchart 
illustrates a process on a local clustered virtual 
machine for determining to migrate threads and/or object 
groups from the local clustered virtual machine to a 

10 remote clustered virtual machine in accordance with load 
or performance information. The process commences when 
the local gJVM node determines to start a load balancing 
operation (step 2202) ; if appropriate, the local gJVM 
node obtains or checks its own load information (step 

15 2204) . The local gJVM also obtains the load information 
from other gJVM nodes within the gJVM cluster (step 
2206), e.g., using the processes that are described above 
with respect to FIG. 20 and FIG. 21, although the load 
information from the other gJVM nodes might be gathered 

20 only after a determination has been made to initiate the 
load-balancing operation. 

The load balancing operation may be performed in 
order to ensure that the workload caused by executing 
various applications within the cluster is distributed 

25 throughout the gJVM cluster. The load balancing 

operation may be the result of a determination within the 
local gJVM node to initiate a load balancing operation in 
accordance with an algorithm that might be employed by 
all gJVM nodes within a gJVM cluster; alternatively, the 

30 determination to perform the load balancing operation may 
be controlled by the primary gJVM node, by an 
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administrative utility, by an administrative command, or 
by some other mechanism. For example, the local gJVM 
node may perform the load balancing operation after it 
has constructed a thread and is ready to dispatch the 
5 thread; alternatively, to save computational resources, 
the load balancing operation might be performed 
periodically, e.g., after every tenth newly constructed 
thread. 

The local gJVM node then begins to evaluate its load 

10 information to check whether or not some resource on the 
local gJVM node has exceeded a configurable threshold 
value, e.g., as indicated within cluster-specific context 
parameters. In the example that is shown in FIG. 22, two 
evaluations are performed; the number of evaluations that 

15 are performed may depend on the load-balancing algorithm, 
the available load information, or some other factor. 

A determination is made as to whether or not the CPU 
load value for the local gJVM node exceeds a CPU-load 
threshold value (step 2208) ; if not, then the process 

20 branches to step 2214. If the CPU load value for the 

local gJVM node exceeds a configurable CPU- load threshold 
value, then the local gJVM node selects a remote gJVM 
node with available CPU resources (step 2210), e.g., by 
analyzing the load information that was previously 

25 gathered from the remote gJVM nodes with respect to 

configurable load threshold values. The local gJVM node 
then begins a migration operation to move threads from 
the local gJVM node to one or more selected remote gJVM 
nodes (step 2212), e.g., using the processes that are 

30 described above with respect to FIG. 17 and FIG. 18. 
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A determination is then made as to whether or not 
the memory load value for the local gJVM node exceeds a 
memory-load threshold value (step 2214) ; if not, then the 
process is concluded. If the memory load value for the 
5 local gJVM node exceeds a configurable memory- load 
threshold value, then the local gJVM node selects a 
remote gJVM node with available memory resources (step 
2216), e.g., by analyzing the load information that was 
previously gathered from the remote gJVM nodes with 

10 respect to configurable load threshold values. The local 
gJVM node then begins a migration operation to move 
object groups from the local gJVM node to one or more 
selected remote gJVM nodes (step 2218), e.g., using the 
processes that are described above with respect to FIG. 

15 16 and FIG. 16, thereby concluding the process. 

Multi-Threaded Application on Locfical Virtual Machine 

With reference now to FIG. 23, a block diagram 
depicts the execution of a single application image that 

20 spans multiple grid-enabled virtual machines in a single 
logical virtual machine over multiple devices. Machines 
2302-2306 support logical virtual machine 2308 and 
logical virtual machine 2310. 

Logical virtual machine 2308 includes only a single 

25 node that is represented by primary gJVM node 2312, which 
may have been invoked through a command line interface 
that specified ''GRID_B" as the cluster name for logical 
virtual machine 2308. Logical virtual machine 2308 is 
supported. upon a single device, i.e., machine 2306. 
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In contrast, logical virtual machine 2310 includes 
multiple gJVM nodes 2314-2320 that are supported by 
multiple devices, i.e., machines 2302-2306; machine 2302 
happens to be concurrently supporting two gJVM nodes 2314 
5 and 2320. Virtual machines 2314-2320 interoperate as a 

single logical virtual machine. In this example, virtual 
machines 2314-2320 are implemented using Java' technology; 
hence, virtual machines 2314-2320 may be described as 
Java* virtual machines. Since virtual machines 2314-2320 

10 have embedded functionality for interoperat ing in a 

grid-like manner as a logical virtual machine, virtual 
machines 2314-2320 may be described as grid-enabled Java* 
virtual machines (gJVM's), each of which is a node within 
logical gJVM 2310. 

15 Each of gJVM nodes 2314-2320 may have been invoked 

through a command line interface that specified "GRID_A" 
as the cluster name for logical gJVM 2310. Nodes 
2316-2320 are non-primary nodes; gJVM 2314 was designated 
as the primary node at startup, although it may delegate 

20 its status as primary node to another node within logical 
gJVM 2310 at a later point in time. In this example, 
nodes 2314-2320 are unique instances of the same gJVM 
application program. Using the methods that are 
described above, gJVM nodes 2316-2320 register into the 

25 cluster/grid of nodes that is managed by primary gJVM 

node 2314; remote references to objects and coordination 
between the gJVM nodes are provided through the gJVM-gJVM 
(peer-to-peer) communications links that the gJVM nodes 
are able to establish. 



AUS920030872US1 

50 

With the present invention, multiple gJVM nodes 
interoperate as a logical gJVM so that an application 
program that starts on one gJVM node may subsequently 
execute in a manner that spans multiple gJVM nodes. In 
5 the example in FIG. 23, an application that includes main 
thread 2322 has been started on gJVM node 2318, and one 
of its secondary threads 2324 also executes on gJVM node 
2318. In addition, secondary threads 2326-2332 of the 
same application execute on gJVM nodes 2314 and 2316 

10 within logical gJVM 2310. 

The present invention allows nodes 2314-2320 to 
share the workload of Java® application 2334 that is 
executing on logical gJVM 2310 without requiring Java* 
application 2334 to deviate from the standard Java* 

15 application model; in other words, Java* application 2334 
does not require any proprietary functionality, and gJVM 
nodes 2314-2320 do not require proprietary application 
agents to act on behalf of Java* application 2334 across 
multiple JVM's. With the present invention, gJVM nodes 

20 2314-2320 have embedded functionality for interoperating 
in a grid-like manner to execute Java* applications. 

Object groups 2336-2346 are distributed throughout 
logical gJVM 2310. Any given application thread may 
reference objects that exist in several object groups, 

25 whether or not the application thread or another thread 
within the same application constructed or allocated 
those object groups. Hence, a given thread may reference 
an object in a local object group or an object in a 
remote object group, whether or not the object group is 

30 associated with the given thread or another thread on the 
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local gJVM node or on a remote gJVM node. If fact, the 
object group may not be associated with any extant 
thread, or the object may be contained in a so-called 
global object group on the local gJVM node or on a remote 
5 gJVM node. In the example that is shown in FIG. 23, 

thread 2330 is shown as referencing an object or objects 
in object groups 2336 and 2346 within local gJVM node 
2314 while also referencing an object or objects in 
object group 2338 within remote gJVM node 2316. 

10 The advantages of the present invention should be 

apparent with reference to the detailed description that 
is provided above. Applications that execute on virtual 
machines may be run in a grid-like manner over a 
plurality of virtual machines, thereby enabling 

15 computational resources to be provided as a utility-like 
service for compensation. When data processing systems 
and devices are shutdown, the clusterable virtual 
machines on those systems and devices may be stopped; 
threads and objects on those systems and devices may be 

20 migrated to other systems and devices while the virtual 
machine cluster allows any applications to continue to 
run, albeit on a set of different systems and devices. 

It is important to note that while the present 
invention has been described in the context of a fully 

25 functioning data processing system, those of ordinary 
skill in the art will appreciate that the processes of 
the present invention are capable of being distributed in 
the form of instructions in a computer readable medium 
and a variety of other forms, regardless of the 

30 particular type of signal bearing media actually used to 
carry out the distribution. Examples of computer 
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readable media include media such as EPROM, ROM, tape, 
paper, floppy disc, hard disk drive, RAM, and CD-ROMs and 
transmission- type media, such as digital and analog 
communications links . 
5 A method is generally conceived to be a 

self -consistent sequence of steps leading to a desired 
result. These steps require physical manipulations of 
physical quantities. Usually, though not necessarily, 
these quantities take the form of electrical or magnetic 

10 signals capable of being stored, transferred, combined, 

compared, and otherwise manipulated. It is convenient at 
times, principally for reasons of common usage, to refer 
to these signals as bits, values, parameters, items, 
elements, objects, symbols, characters, terms, numbers, 

15 or the like. It should be noted, however, that all of 
these terms and similar terms are to be associated with 
the appropriate physical quantities and are merely 
convenient labels applied to these quantities. 

The description of the present invention has been 

20 presented for purposes of illustration but is not 

intended to be exhaustive or limited to the disclosed 
embodiments. Many modifications and variations will be 
apparent to those of ordinary skill in the art. The 
embodiments were chosen to explain the principles of the 

25 invention and its practical applications and to enable 
others of ordinary skill in the art to understand the 
invention in order to implement various embodiments with 
various modifications as might be suited to other 
contemplated uses . 



